Extraction of Statistical Terms and Co-occurrence Networks from Newspapers
نویسندگان
چکیده
In this paper, we automatically extract statistical terms and build their co-occurrence networks from newspapers. Statistical terms are expression of the measurements of statistics to watch the movements of phenomena; birth rates, public approval rating of the Cabinet and so on. In recent years, we have a vast amount of available information because of computerization and the technologies of making their overview and enhancement of their values are noticed. One of them is the technology of visualizing information of social trend and movements from newspapers. For visualizing trend information, there some approaches. In this paper, we take the approach of building networks of causal relations among the statistical terms. To extract statistical terms, we propose extraction method using suffixes. To extract causal relations among statistical terms, we first extract co-occurrence relations and next show them with the networks. We can extract many statistical terms with high accuracy by our method and find interesting links among some statistical terms by our co-occurrence networks.
منابع مشابه
Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information
We present a new keyword extraction algorithm that applies to a single document without using a corpus. Frequent terms are extracted first, then a set of cooccurrence between each term and the frequent terms, i.e., occurrences in the same sentences, is generated. Co-occurrence distribution shows importance of a term in the document as follows. If probability distribution of co-occurrence betwee...
متن کاملThe analysis of co-citation and word co-occurrence networks of Iranian articles in the field of dentistry
Background and Aims: Dentistry is an important profession ensuring the health of body and soul, and has a special place in the scientific productions of medical disciplines. The purpose of this study was to analyze the co-citation and word co-occurrence of Iranian research papers in the field of dentistry based on indexed documents in Web of Science from 2014 to 2018. Materials and Methods:...
متن کاملکندوکاوی در انعکاس موضوعات کتابداری و اطلاع رسانی در روزنامه های کثیرالانتشار سال 1390
Purpose: The objective of this research is to identify the occurrence rate , publication style and type of topics related to library and information science that were found in widely read newspapers in 1390 (2011). Methodology: In this research, the content analysis was used in order to investigate library and information science topics in widely read Iranian newspapers. The The statistical su...
متن کاملSecond-Order Statistical Texture Representation of Asphalt Pavement Distress Images Based on Local Binary Pattern in Spatial and Wavelet Domain
Assessment of pavement distresses is one of the important parts of pavement management systems to adopt the most effective road maintenance strategy. In the last decade, extensive studies have been done to develop automated systems for pavement distress processing based on machine vision techniques. One of the most important structural components of computer vision is the feature extraction met...
متن کاملMapping the Scientific Structure of Iranian Brucellosis Researches Using the Co-authorship and Co-occurrence Network Analysis
Background and Objective: The evaluation of the publishing trend of articles in various scientific fields provides an insight into the efforts of researchers in the field of knowledge. Accordingly, the present study has evaluated and analyzed the scientific publications on brucellosis conducted by Iranian researchers using scientometrics methods and analysis of social networks. Methods: The pr...
متن کامل